Content Collection for the Labeling of Health-related Web Content
نویسندگان
چکیده
As the number of health-related web sites in various languages increases, it is more than necessary to implement control mechanisms that give the users adequate guarantee that the web resources they are visiting, meet a minimum level of quality standards. Based upon state-of-the-art technology in the areas of semantic web, content analysis and quality labeling, the AQUA system, designed for the EC-funded project MedIEQ, aims to support the automation of the labeling process in health-related web content. AQUA provides tools that crawl the web to locate unlabelled health web resources in different European languages, as well as tools that traverse websites, identify and extract information and, upon this information, propose labels or monitor already labeled resources. Two major steps in this automated labeling process are web content collection and information extraction. This paper focuses on content collection. We describe existing approaches, present the architecture of the content collection toolkit and how this is integrated within the AQUA system, and discuss our initial experimental results in the English language (six more languages will be covered by the end of the project).
منابع مشابه
Methodology and architecture for content collection Distribution: Project internal MedIEQ Quality Labeling of Medical Web content using Multilingual Information Extraction
متن کامل
Automating Accreditation of Medical Web Content
123456The increasing amount of freely available healthrelated web content generates, on one hand, excellent conditions for self-education of patients as well as physicians, but on the other hand entails substantial risks if such information is trusted irrespective of low competence or even bad intentions of its authors. This is why medical web resources accreditation by renowned authorities is ...
متن کاملContent Collection for the Labelling of Health-Related Web Content
As the number of health-related web sites in various languages increases, so does the need for control mechanisms that give the users adequate guarantee on whether the web resources they are visiting meet a minimum level of quality standards. Based upon state-of-the-art technology in the areas of semantic web, content analysis and quality labelling, the MedIEQ project, integrates existing techn...
متن کاملInvestigating Healthcare Personnel’s Satisfaction with Quality of Web-based Learning in Teaching Preventive Behaviors of Hepatitis B Virus Infection
Introduction: Acceptance and implementation of preventive behaviors through new methods by healthcare personnel are of great importance. The aim of this study was to investigate healthcare personnel’s satisfaction with quality of web-based learning in teaching preventive behaviors of hepatitis B virus infection.Methods: This descriptive study was conducted on 120 healthcare employees in Tehran ...
متن کاملTrouble Spots in Online Direct-to-Consumer Prescription Drug Promotion: A Content Analysis of FDA Warning Letters
Background For the purpose of understanding the Food and Drug Administration’s (FDA’s) concerns regarding online promotion of prescription drugs advertised directly to consumers, this study examines notices of violations (NOVs) and warning letters issued by the FDA to pharmaceutical manufacturers. Methods The FDA’s warning letters and NOVs, which were issued to pharmaceutical companies over a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008